Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 24
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
bioRxiv ; 2024 Apr 28.
Artigo em Inglês | MEDLINE | ID: mdl-38712217

RESUMO

A critical body of knowledge has developed through advances in protein microscopy, protein-fold modeling, structural biology software, availability of sequenced bacterial genomes, large-scale mutation databases, and genome-scale models. Based on these recent advances, we develop a computational framework that; i) identifies the oligomeric structural proteome encoded by an organism's genome from available structural resources; ii) maps multi-strain alleleomic variation, resulting in the structural proteome for a species; and iii) calculates the 3D orientation of proteins across subcellular compartments with residue-level precision. Using the platform, we; iv) compute the quaternary E. coli K-12 MG1655 structural proteome; v) use a dataset of 12,000 mutations to build Random Forest classifiers that can predict the severity of mutations; and, in combination with a genome-scale model that computes proteome allocation, vi) obtain the spatial allocation of the E. coli proteome. Thus, in conjunction with relevant datasets and increasingly accurate computational models, we can now annotate quaternary structural proteomes, at genome-scale, to obtain a molecular-level understanding of whole-cell functions. Significance: Advancements in experimental and computational methods have revealed the shapes of multi-subunit proteins. The absence of a unified platform that maps actionable datatypes onto these increasingly accurate structures creates a barrier to structural analyses, especially at the genome-scale. Here, we describe QSPACE, a computational annotation platform that evaluates existing resources to identify the best-available structure for each protein in a user's query, maps the 3D location of actionable datatypes ( e.g. , active sites, published mutations) onto the selected structures, and uses third-party APIs to determine the subcellular compartment of all amino acids of a protein. As proof-of-concept, we deployed QSPACE to generate the quaternary structural proteome of E. coli MG1655 and demonstrate two use-cases involving large-scale mutant analysis and genome-scale modelling.

2.
Res Sq ; 2023 May 19.
Artigo em Inglês | MEDLINE | ID: mdl-37292890

RESUMO

A critical body of knowledge has developed through advances in protein microscopy, protein-fold modeling, structural biology software, availability of sequenced bacterial genomes, large-scale mutation databases, and genome-scale models. Based on these recent advances, we develop a computational platform that; i) computes the oligomeric structural proteome encoded by an organism's genome; ii) maps multi-strain alleleomic variation, resulting in the structural proteome for a species; and iii) calculates the 3D orientation of proteins across subcellular compartments with angstrom-level precision. Using the platform, we; iv) compute the full quaternary E. coli K-12 MG1655 structural proteome; v) deploy structure-guided analyses to identify consequential mutations; and, in combination with a genome-scale model that computes proteome allocation, vi) obtain a draft 3D visualization of the proteome in a functioning cell. Thus, in conjunction with relevant datasets and computational models, we can now resolve genome-scale structural proteomes to obtain an angstrom-level understanding of whole-cell functions.

3.
Mol Syst Biol ; 17(7): e10099, 2021 07.
Artigo em Inglês | MEDLINE | ID: mdl-34288418

RESUMO

Mesoplasma florum, a fast-growing near-minimal organism, is a compelling model to explore rational genome designs. Using sequence and structural homology, the set of metabolic functions its genome encodes was identified, allowing the reconstruction of a metabolic network representing ˜ 30% of its protein-coding genes. Growth medium simplification enabled substrate uptake and product secretion rate quantification which, along with experimental biomass composition, were integrated as species-specific constraints to produce the functional iJL208 genome-scale model (GEM) of metabolism. Genome-wide expression and essentiality datasets as well as growth data on various carbohydrates were used to validate and refine iJL208. Discrepancies between model predictions and observations were mechanistically explained using protein structures and network analysis. iJL208 was also used to propose an in silico reduced genome. Comparing this prediction to the minimal cell JCVI-syn3.0 and its parent JCVI-syn1.0 revealed key features of a minimal gene set. iJL208 is a stepping-stone toward model-driven whole-genome engineering.


Assuntos
Genoma , Redes e Vias Metabólicas , Genoma/genética , Genômica , Redes e Vias Metabólicas/genética , Modelos Biológicos
4.
PLoS Comput Biol ; 17(1): e1008596, 2021 01.
Artigo em Inglês | MEDLINE | ID: mdl-33465077

RESUMO

The fitness landscape is a concept commonly used to describe evolution towards optimal phenotypes. It can be reduced to mechanistic detail using genome-scale models (GEMs) from systems biology. We use recently developed GEMs of Metabolism and protein Expression (ME-models) to study the distribution of Escherichia coli phenotypes on the rate-yield plane. We found that the measured phenotypes distribute non-uniformly to form a highly stratified fitness landscape. Systems analysis of the ME-model simulations suggest that this stratification results from discrete ATP generation strategies. Accordingly, we define "aero-types", a phenotypic trait that characterizes how a balanced proteome can achieve a given growth rate by modulating 1) the relative utilization of oxidative phosphorylation, glycolysis, and fermentation pathways; and 2) the differential employment of electron-transport-chain enzymes. This global, quantitative, and mechanistic systems biology interpretation of fitness landscape formed upon proteome allocation offers a fundamental understanding of bacterial physiology and evolution dynamics.


Assuntos
Escherichia coli , Aptidão Genética/genética , Proteoma , Trifosfato de Adenosina/metabolismo , Escherichia coli/classificação , Escherichia coli/genética , Escherichia coli/metabolismo , Proteínas de Escherichia coli/genética , Proteínas de Escherichia coli/metabolismo , Evolução Molecular , Regulação Bacteriana da Expressão Gênica/genética , Genoma Bacteriano/genética , Modelos Genéticos , Nitratos/metabolismo , Fenótipo , Proteoma/genética , Proteoma/metabolismo , Biologia de Sistemas
5.
BMC Bioinformatics ; 21(1): 162, 2020 Apr 29.
Artigo em Inglês | MEDLINE | ID: mdl-32349661

RESUMO

BACKGROUND: The reconstruction of metabolic networks and the three-dimensional coverage of protein structures have reached the genome-scale in the widely studied Escherichia coli K-12 MG1655 strain. The combination of the two leads to the formation of a structural systems biology framework, which we have used to analyze differences between the reactive oxygen species (ROS) sensitivity of the proteomes of sequenced strains of E. coli. As proteins are one of the main targets of oxidative damage, understanding how the genetic changes of different strains of a species relates to its oxidative environment can reveal hypotheses as to why these variations arise and suggest directions of future experimental work. RESULTS: Creating a reference structural proteome for E. coli allows us to comprehensively map genetic changes in 1764 different strains to their locations on 4118 3D protein structures. We use metabolic modeling to predict basal ROS production levels (ROStype) for 695 of these strains, finding that strains with both higher and lower basal levels tend to enrich their proteomes with antioxidative properties, and speculate as to why that is. We computationally assess a strain's sensitivity to an oxidative environment, based on known chemical mechanisms of oxidative damage to protein groups, defined by their localization and functionality. Two general groups - metalloproteins and periplasmic proteins - show enrichment of their antioxidative properties between the 695 strains with a predicted ROStype as well as 116 strains with an assigned pathotype. Specifically, proteins that a) utilize a molybdenum ion as a cofactor and b) are involved in the biogenesis of fimbriae show intriguing protective properties to resist oxidative damage. Overall, these findings indicate that a strain's sensitivity to oxidative damage can be elucidated from the structural proteome, though future experimental work is needed to validate our model assumptions and findings. CONCLUSION: We thus demonstrate that structural systems biology enables a proteome-wide, computational assessment of changes to atomic-level physicochemical properties and of oxidative damage mechanisms for multiple strains in a species. This integrative approach opens new avenues to study adaptation to a particular environment based on physiological properties predicted from sequence alone.


Assuntos
Adaptação Fisiológica , Escherichia coli K12/fisiologia , Estresse Oxidativo , Proteoma/metabolismo , Antioxidantes/metabolismo , Proteínas de Escherichia coli/metabolismo , Fímbrias Bacterianas/metabolismo , Modelos Biológicos , Molibdênio/metabolismo , Óperon/genética , Oxirredução , Periplasma/metabolismo , Fenótipo , Espécies Reativas de Oxigênio/metabolismo
6.
Protein Sci ; 29(3): 711-722, 2020 03.
Artigo em Inglês | MEDLINE | ID: mdl-31811683

RESUMO

Galactarate dehydratase (GarD) is the first enzyme in the galactarate/glucarate pathway and catalyzes the dehydration of galactarate to 3-keto-5-dehydroxygalactarate. This protein is known to increase colonization fitness of intestinal pathogens in antibiotic-treated mice and to promote bacterial survival during stress. The galactarate/glucarate pathway is widespread in bacteria, but not in humans, and thus could be a target to develop new inhibitors for use in combination therapy to combat antibiotic resistance. The structure of almost all the enzymes of the galactarate/glucarate pathway were solved previously, except for GarD, for which only the structure of the N-terminal domain was determined previously. Herein, we report the first crystal structure of full-length GarD solved using a seleno-methoionine derivative revealing a new protein fold. The protein consists of three domains, each presenting a novel twist as compared to their distant homologs. GarD in the crystal structure forms dimers and each monomer consists of three domains. The N-terminal domain is comprised of a ß-clip fold, connected to the second domain by a long unstructured linker. The second domain serves as a dimerization interface between two monomers. The C-terminal domain forms an unusual variant of a Rossmann fold with a crossover and is built around a seven-stranded parallel ß-sheet supported by nine α-helices. A metal binding site in the C-terminal domain is occupied by Ca2+ . The activity of GarD was corroborated by the production of 5-keto-4-deoxy-D-glucarate under reducing conditions and in the presence of iron. Thus, GarD is an unusual enolase with a novel protein fold never previously seen in this class of enzymes.


Assuntos
Antibacterianos/farmacologia , Bactérias/efeitos dos fármacos , Bactérias/enzimologia , Hidroliases/química , Fosfopiruvato Hidratase/química , Cristalografia por Raios X , Hidroliases/metabolismo , Testes de Sensibilidade Microbiana , Modelos Moleculares , Estrutura Molecular , Fosfopiruvato Hidratase/metabolismo , Dobramento de Proteína/efeitos dos fármacos
7.
Mol Syst Biol ; 15(11): e8601, 2019 11.
Artigo em Inglês | MEDLINE | ID: mdl-31777175

RESUMO

Biology is reaching a convergence point of its historic reductionist and modern holistic approaches to understanding the living system. Structural biology has historically taken the reductionist approach to deeply probe the inner workings of complex molecular machines. In contrast, systems biology and genome-scale modeling have organically grown out of the wealth of data now being generated by diverse omics measurements. In the late 2000s, a proposed interdisciplinary field of structural systems biology pitched the merger of these two approaches, with widespread applications in pharmacology, disease modeling, protein engineering, and evolutionary studies. In this commentary, we highlight the challenges of integrating these two fields, with a focus on genome-scale metabolic modeling, and the novel findings that are made possible from such a merger.


Assuntos
Genoma , Biologia de Sistemas/métodos , Animais , Humanos , Modelos Biológicos , Conformação Proteica , Proteínas/química
8.
Proc Natl Acad Sci U S A ; 116(28): 14368-14373, 2019 07 09.
Artigo em Inglês | MEDLINE | ID: mdl-31270234

RESUMO

Catalysis using iron-sulfur clusters and transition metals can be traced back to the last universal common ancestor. The damage to metalloproteins caused by reactive oxygen species (ROS) can prevent cell growth and survival when unmanaged, thus eliciting an essential stress response that is universal and fundamental in biology. Here we develop a computable multiscale description of the ROS stress response in Escherichia coli, called OxidizeME. We use OxidizeME to explain four key responses to oxidative stress: 1) ROS-induced auxotrophy for branched-chain, aromatic, and sulfurous amino acids; 2) nutrient-dependent sensitivity of growth rate to ROS; 3) ROS-specific differential gene expression separate from global growth-associated differential expression; and 4) coordinated expression of iron-sulfur cluster (ISC) and sulfur assimilation (SUF) systems for iron-sulfur cluster biosynthesis. These results show that we can now develop fundamental and quantitative genotype-phenotype relationships for stress responses on a genome-wide basis.


Assuntos
Proteínas Ferro-Enxofre/genética , Ferro/metabolismo , Metaloproteínas/genética , Espécies Reativas de Oxigênio/metabolismo , Catálise , Proliferação de Células/genética , Escherichia coli/genética , Escherichia coli/metabolismo , Regulação da Expressão Gênica/genética , Peróxido de Hidrogênio/metabolismo , Óperon/genética , Estresse Oxidativo/genética , Enxofre/metabolismo
9.
PLoS Comput Biol ; 15(1): e1006644, 2019 01.
Artigo em Inglês | MEDLINE | ID: mdl-30625152

RESUMO

S. aureus is classified as a serious threat pathogen and is a priority that guides the discovery and development of new antibiotics. Despite growing knowledge of S. aureus metabolic capabilities, our understanding of its systems-level responses to different media types remains incomplete. Here, we develop a manually reconstructed genome-scale model (GEM-PRO) of metabolism with 3D protein structures for S. aureus USA300 str. JE2 containing 854 genes, 1,440 reactions, 1,327 metabolites and 673 3-dimensional protein structures. Computations were in 85% agreement with gene essentiality data from random barcode transposon site sequencing (RB-TnSeq) and 68% agreement with experimental physiological data. Comparisons of computational predictions with experimental observations highlight: 1) cases of non-essential biomass precursors; 2) metabolic genes subject to transcriptional regulation involved in Staphyloxanthin biosynthesis; 3) the essentiality of purine and amino acid biosynthesis in synthetic physiological media; and 4) a switch to aerobic fermentation upon exposure to extracellular glucose elucidated as a result of integrating time-course of quantitative exo-metabolomics data. An up-to-date GEM-PRO thus serves as a knowledge-based platform to elucidate S. aureus' metabolic response to its environment.


Assuntos
Meios de Cultura , Genoma Bacteriano/genética , Staphylococcus aureus , Biologia de Sistemas/métodos , Meios de Cultura/metabolismo , Meios de Cultura/farmacologia , Regulação Bacteriana da Expressão Gênica/efeitos dos fármacos , Regulação Bacteriana da Expressão Gênica/genética , Bases de Conhecimento , Redes e Vias Metabólicas/efeitos dos fármacos , Redes e Vias Metabólicas/genética , Metaboloma/efeitos dos fármacos , Metaboloma/genética , Metabolômica , Modelos Biológicos , Staphylococcus aureus/efeitos dos fármacos , Staphylococcus aureus/genética , Staphylococcus aureus/metabolismo , Staphylococcus aureus/fisiologia
10.
Biophys J ; 116(2): 205-214, 2019 01 22.
Artigo em Inglês | MEDLINE | ID: mdl-30606449

RESUMO

The atomic-level mechanisms that coordinate ligand release from protein pockets are only known for a handful of proteins. Here, we report results from accelerated molecular dynamics simulations for benzene dissociation from the buried cavity of the T4 lysozyme Leu99Ala mutant (L99A). In these simulations, benzene is released through a previously characterized, sparsely populated room-temperature excited state of the mutant, explaining the coincidence for experimentally measured benzene off rate and apo protein slow-timescale NMR relaxation rates between ground and excited states. The path observed for benzene egress is a multistep ligand migration from the buried cavity to ultimate release through an opening between the F/G-, H-, and I-helices and requires a number of cooperative multiresidue and secondary-structure rearrangements within the C-terminal domain of L99A. These rearrangements are identical to those observed along the ground state to excited state transitions characterized by molecular dynamic simulations run on the Anton supercomputer. Analyses of the molecular properties of the residues lining the egress path suggest that protein surface electrostatic potential may play a role in the release mechanism. Simulations of wild-type T4 lysozyme also reveal that benzene-egress-associated dynamics in the L99A mutant are potentially exaggerations of the substrate-processivity-related dynamics of the wild type.


Assuntos
Benzeno/química , Simulação de Dinâmica Molecular , Muramidase/química , Substituição de Aminoácidos , Sítios de Ligação , Simulação de Acoplamento Molecular , Muramidase/genética , Muramidase/metabolismo , Ligação Proteica , Eletricidade Estática
11.
Nat Commun ; 9(1): 5252, 2018 12 07.
Artigo em Inglês | MEDLINE | ID: mdl-30531987

RESUMO

Knowing the catalytic turnover numbers of enzymes is essential for understanding the growth rate, proteome composition, and physiology of organisms, but experimental data on enzyme turnover numbers is sparse and noisy. Here, we demonstrate that machine learning can successfully predict catalytic turnover numbers in Escherichia coli based on integrated data on enzyme biochemistry, protein structure, and network context. We identify a diverse set of features that are consistently predictive for both in vivo and in vitro enzyme turnover rates, revealing novel protein structural correlates of catalytic turnover. We use our predictions to parameterize two mechanistic genome-scale modelling frameworks for proteome-limited metabolism, leading to significantly higher accuracy in the prediction of quantitative proteome data than previous approaches. The presented machine learning models thus provide a valuable tool for understanding metabolism and the proteome at the genome scale, and elucidate structural, biochemical, and network properties that underlie enzyme kinetics.


Assuntos
Proteínas de Escherichia coli/metabolismo , Escherichia coli/enzimologia , Aprendizado de Máquina , Redes e Vias Metabólicas , Algoritmos , Biocatálise , Escherichia coli/genética , Proteínas de Escherichia coli/genética , Cinética , Modelos Biológicos , Proteoma/genética , Proteoma/metabolismo
12.
Nat Commun ; 9(1): 4306, 2018 10 17.
Artigo em Inglês | MEDLINE | ID: mdl-30333483

RESUMO

Mycobacterium tuberculosis is a serious human pathogen threat exhibiting complex evolution of antimicrobial resistance (AMR). Accordingly, the many publicly available datasets describing its AMR characteristics demand disparate data-type analyses. Here, we develop a reference strain-agnostic computational platform that uses machine learning approaches, complemented by both genetic interaction analysis and 3D structural mutation-mapping, to identify signatures of AMR evolution to 13 antibiotics. This platform is applied to 1595 sequenced strains to yield four key results. First, a pan-genome analysis shows that M. tuberculosis is highly conserved with sequenced variation concentrated in PE/PPE/PGRS genes. Second, the platform corroborates 33 genes known to confer resistance and identifies 24 new genetic signatures of AMR. Third, 97 epistatic interactions across 10 resistance classes are revealed. Fourth, detailed structural analysis of these genes yields mechanistic bases for their selection. The platform can be used to study other human pathogens.


Assuntos
Farmacorresistência Bacteriana/genética , Genoma Bacteriano , Aprendizado de Máquina , Mycobacterium tuberculosis/genética , Frequência do Gene , Seleção Genética
13.
Nucleic Acids Res ; 46(20): 10682-10696, 2018 11 16.
Artigo em Inglês | MEDLINE | ID: mdl-30137486

RESUMO

Transcriptional regulation enables cells to respond to environmental changes. Of the estimated 304 candidate transcription factors (TFs) in Escherichia coli K-12 MG1655, 185 have been experimentally identified, but ChIP methods have been used to fully characterize only a few dozen. Identifying these remaining TFs is key to improving our knowledge of the E. coli transcriptional regulatory network (TRN). Here, we developed an integrated workflow for the computational prediction and comprehensive experimental validation of TFs using a suite of genome-wide experiments. We applied this workflow to (i) identify 16 candidate TFs from over a hundred uncharacterized genes; (ii) capture a total of 255 DNA binding peaks for ten candidate TFs resulting in six high-confidence binding motifs; (iii) reconstruct the regulons of these ten TFs by determining gene expression changes upon deletion of each TF and (iv) identify the regulatory roles of three TFs (YiaJ, YdcI, and YeiE) as regulators of l-ascorbate utilization, proton transfer and acetate metabolism, and iron homeostasis under iron-limited conditions, respectively. Together, these results demonstrate how this workflow can be used to discover, characterize, and elucidate regulatory functions of uncharacterized TFs in parallel.


Assuntos
Escherichia coli K12/genética , Proteínas de Escherichia coli/genética , Perfilação da Expressão Gênica , Fatores de Transcrição/genética , Escherichia coli K12/metabolismo , Proteínas de Escherichia coli/metabolismo , Regulação Bacteriana da Expressão Gênica , Redes Reguladoras de Genes , Análise de Sequência com Séries de Oligonucleotídeos/métodos , Fatores de Transcrição/metabolismo
14.
Front Microbiol ; 9: 1082, 2018.
Artigo em Inglês | MEDLINE | ID: mdl-29887846

RESUMO

Two-component systems (TCSs) consist of a histidine kinase and a response regulator. Here, we evaluated the conservation of the AgrAC TCS among 149 completely sequenced Staphylococcus aureus strains. It is composed of four genes: agrBDCA. We found that: (i) AgrAC system (agr) was found in all but one of the 149 strains, (ii) the agr positive strains were further classified into four agr types based on AgrD protein sequences, (iii) the four agr types not only specified the chromosomal arrangement of the agr genes but also the sequence divergence of AgrC histidine kinase protein, which confers signal specificity, (iv) the sequence divergence was reflected in distinct structural properties especially in the transmembrane region and second extracellular binding domain, and (v) there was a strong correlation between the agr type and the virulence genomic profile of the organism. Taken together, these results demonstrate that bioinformatic analysis of the agr locus leads to a classification system that correlates with the presence of virulence factors and protein structural properties.

15.
BMC Syst Biol ; 12(1): 66, 2018 06 11.
Artigo em Inglês | MEDLINE | ID: mdl-29890970

RESUMO

BACKGROUND: Escherichia coli is considered a leading bacterial trigger of inflammatory bowel disease (IBD). E. coli isolates from IBD patients primarily belong to phylogroup B2. Previous studies have focused on broad comparative genomic analysis of E. coli B2 isolates, and identified virulence factors that allow B2 strains to reside within human intestinal mucosa. Metabolic capabilities of E. coli strains have been shown to be related to their colonization site, but remain unexplored in IBD-associated strains. RESULTS: In this study, we utilized pan-genome analysis and genome-scale models (GEMs) of metabolism to study metabolic capabilities of IBD-associated E. coli B2 strains. The study yielded three results: i) Pan-genome analysis of 110 E. coli strains (including 53 isolates from IBD studies) revealed discriminating metabolic genes between B2 strains and other strains; ii) Both comparative genomic analysis and GEMs suggested that B2 strains have an advantage in degrading and utilizing sugars derived from mucus glycan, and iii) GEMs revealed distinct metabolic features in B2 strains that potentially allow them to utilize energy more efficiently. For example, B2 strains lack the enzymes to degrade amadori products, but instead rely on neighboring bacteria to convert these substrates into a more readily usable and potentially less sought after product. CONCLUSIONS: Taken together, these results suggest that the metabolic capabilities of B2 strains vary significantly from those of other strains, enabling B2 strains to colonize intestinal mucosa.The results from this study motivate a broad experimental assessment of the nutritional effects on E. coli B2 pathophysiology in IBD patients.


Assuntos
Escherichia coli/metabolismo , Doenças Inflamatórias Intestinais/microbiologia , Mucosa Intestinal/microbiologia , Escherichia coli/genética , Escherichia coli/fisiologia , Genômica , Humanos
16.
Bioinformatics ; 34(12): 2155-2157, 2018 06 15.
Artigo em Inglês | MEDLINE | ID: mdl-29444205

RESUMO

Summary: Working with protein structures at the genome-scale has been challenging in a variety of ways. Here, we present ssbio, a Python package that provides a framework to easily work with structural information in the context of genome-scale network reconstructions, which can contain thousands of individual proteins. The ssbio package provides an automated pipeline to construct high quality genome-scale models with protein structures (GEM-PROs), wrappers to popular third-party programs to compute associated protein properties, and methods to visualize and annotate structures directly in Jupyter notebooks, thus lowering the barrier of linking 3D structural data with established systems workflows. Availability and implementation: ssbio is implemented in Python and available to download under the MIT license at http://github.com/SBRG/ssbio. Documentation and Jupyter notebook tutorials are available at http://ssbio.readthedocs.io/en/latest/. Interactive notebooks can be launched using Binder at https://mybinder.org/v2/gh/SBRG/ssbio/master?filepath=Binder.ipynb. Supplementary information: Supplementary data are available at Bioinformatics online.


Assuntos
Biologia Computacional/métodos , Modelos Biológicos , Conformação Proteica , Software
17.
Nat Biotechnol ; 36(3): 272-281, 2018 03.
Artigo em Inglês | MEDLINE | ID: mdl-29457794

RESUMO

Genome-scale network reconstructions have helped uncover the molecular basis of metabolism. Here we present Recon3D, a computational resource that includes three-dimensional (3D) metabolite and protein structure data and enables integrated analyses of metabolic functions in humans. We use Recon3D to functionally characterize mutations associated with disease, and identify metabolic response signatures that are caused by exposure to certain drugs. Recon3D represents the most comprehensive human metabolic network model to date, accounting for 3,288 open reading frames (representing 17% of functionally annotated human genes), 13,543 metabolic reactions involving 4,140 unique metabolites, and 12,890 protein structures. These data provide a unique resource for investigating molecular mechanisms of human metabolism. Recon3D is available at http://vmh.life.


Assuntos
Biologia Computacional/métodos , Bases de Dados de Proteínas , Redes e Vias Metabólicas/genética , Bases de Dados Genéticas , Humanos , Internet , Anotação de Sequência Molecular , Fases de Leitura Aberta/genética
18.
Proc Natl Acad Sci U S A ; 114(43): 11548-11553, 2017 10 24.
Artigo em Inglês | MEDLINE | ID: mdl-29073085

RESUMO

Maintenance of a properly folded proteome is critical for bacterial survival at notably different growth temperatures. Understanding the molecular basis of thermoadaptation has progressed in two main directions, the sequence and structural basis of protein thermostability and the mechanistic principles of protein quality control assisted by chaperones. Yet we do not fully understand how structural integrity of the entire proteome is maintained under stress and how it affects cellular fitness. To address this challenge, we reconstruct a genome-scale protein-folding network for Escherichia coli and formulate a computational model, FoldME, that provides statistical descriptions of multiscale cellular response consistent with many datasets. FoldME simulations show (i) that the chaperones act as a system when they respond to unfolding stress rather than achieving efficient folding of any single component of the proteome, (ii) how the proteome is globally balanced between chaperones for folding and the complex machinery synthesizing the proteins in response to perturbation, (iii) how this balancing determines growth rate dependence on temperature and is achieved through nonspecific regulation, and (iv) how thermal instability of the individual protein affects the overall functional state of the proteome. Overall, these results expand our view of cellular regulation, from targeted specific control mechanisms to global regulation through a web of nonspecific competing interactions that modulate the optimal reallocation of cellular resources. The methodology developed in this study enables genome-scale integration of environment-dependent protein properties and a proteome-wide study of cellular stress responses.


Assuntos
Proteínas de Escherichia coli/metabolismo , Escherichia coli/fisiologia , Proteínas de Choque Térmico/metabolismo , Simulação por Computador , Proteínas de Escherichia coli/genética , Regulação Bacteriana da Expressão Gênica/fisiologia , Temperatura Alta , Humanos , Modelos Químicos , Dobramento de Proteína , Proteoma
20.
Proc Natl Acad Sci U S A ; 114(38): 10286-10291, 2017 09 19.
Artigo em Inglês | MEDLINE | ID: mdl-28874552

RESUMO

Transcriptional regulatory networks (TRNs) have been studied intensely for >25 y. Yet, even for the Escherichia coli TRN-probably the best characterized TRN-several questions remain. Here, we address three questions: (i) How complete is our knowledge of the E. coli TRN; (ii) how well can we predict gene expression using this TRN; and (iii) how robust is our understanding of the TRN? First, we reconstructed a high-confidence TRN (hiTRN) consisting of 147 transcription factors (TFs) regulating 1,538 transcription units (TUs) encoding 1,764 genes. The 3,797 high-confidence regulatory interactions were collected from published, validated chromatin immunoprecipitation (ChIP) data and RegulonDB. For 21 different TF knockouts, up to 63% of the differentially expressed genes in the hiTRN were traced to the knocked-out TF through regulatory cascades. Second, we trained supervised machine learning algorithms to predict the expression of 1,364 TUs given TF activities using 441 samples. The algorithms accurately predicted condition-specific expression for 86% (1,174 of 1,364) of the TUs, while 193 TUs (14%) were predicted better than random TRNs. Third, we identified 10 regulatory modules whose definitions were robust against changes to the TRN or expression compendium. Using surrogate variable analysis, we also identified three unmodeled factors that systematically influenced gene expression. Our computational workflow comprehensively characterizes the predictive capabilities and systems-level functions of an organism's TRN from disparate data types.


Assuntos
Escherichia coli/metabolismo , Regulação Bacteriana da Expressão Gênica , Redes Reguladoras de Genes , Fatores de Transcrição/metabolismo , Escherichia coli/genética , Transcriptoma
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...